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ABSTRACT 

Knowledge of the supernova (SN) delay time distribution (DTD) - the SN rate versus time that 
would follow a hypothetical brief burst of star formation - can shed light on SN progenitors and 
physics, as well as on the timescales of chemical enrichment in different environments. We compile 
recent measurements of the Type-la SN (SN la) rate in galaxy clusters at redshifts from z = out 
to z = 1.45, just 2 Gyr after cluster star formation at z ~ 3. We review the plausible range for the 
observed total iron-to-stellar mass ratio in clusters, based on the latest data and analyses, and use 
it to constrain the time-integrated number of SN la events in clusters. With these data, we recover 
the DTD of SNe la in cluster environments. The DTD is sharply peaked at the shortest time-delay 
interval we probe, < t < 2.2 Gyr, with a low tail out to delays of ~ 10 Gyr, and is remarkably 
consistent with several recent DTD reconstructions based on different methods, applied to different 
environments. We test DTD models from the literature, requiring that they simultaneously reproduce 
the observed cluster SN rates and the observed iron-to-stellar mass ratios. A parametrized power-law 
DTD of the form i- 12±0 - 3 from t = 400 Myr to a Hubble time, can satisfy both constraints. Shallower 
power laws, such as i -1 / 2 cannot, assuming a single DTD, and a single star-formation burst (either 
brief or extended) at high z. This implies 50-85% of SNe la explode within 1 Gyr of star formation. 
DTDs from double-degenerate (DD) models, which generically have ~ t^ 1 shapes over a wide range 
of timescales, match the data, but only if their predictions are scaled up by factors of 5 — 10. Single 
degenerate (SD) DTDs always give poor fits to the data, due to a lack of delayed SNe and overall 
low numbers of SNe. The observations can also be reproduced with a combination of two SN la 
populations - a prompt SD population of SNe la that explodes within a few Gyr of star formation, 
and produces about 60 percent of the iron mass in clusters, and a DD population that contributes the 
events seen at z < 1.5 An alternative scenario of a single, prompt, SN la population, but a composite 
star-formation history in clusters, consisting of a burst at high z, followed by a constant star-formation 
rate, can reproduce the SN rates, but is at odds with direct measurements of star formation in clusters 
at < z < 1. Our results support the existence of a DD progenitor channel for SNe la, if the overall 
predicted numbers can be suitably increased. 

Subject headings: supernovae: general - galaxies: clusters: general 



1. INTRODUCTION 

Supernovae (SNe) play a central role in astrophysics, 
not only as distance indicators for cosmology (e.g., Riess 
et al. 1998; Perlmutter et al. 1999), but as prime syn- 
thesizers of heavy elements (e.g. Woosley 2007), sources 
of kinetic energy, and accelerators of cosmic rays (e.g. 
Helder et al. 2009). However, many of the most ba- 
sic facts about these events are still poorly understood. 
Core-collapse (CC) SNe are descended from massive 
stars, roughly in the range 8 — 50 M Q (e.g. Gal- Yam 
& Leonard 2009), but the exact limits are not known, 
neither from theory nor from observation. It is not yet 
clear what are the physical parameters (mass, binarity, 
rotation, magnetic field, and more) that determine the 
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variety of observed CC-SN subtypes (IIP, lb, Ic, Iln, ILL; 
see Filippenko 1997, for a review). Type-la SNe (SNe la) 
are linked by indirect evidence to the thermonuclear det- 
onations of carbon-oxygen white dwarfs (WDs) whose 
mass has grown to near the Chandrasekhar limit (Hoyle 
& Fowler 1960). However, competing scenarios exist for 
the initial conditions and evolutionary paths that lead to 
this mass growth. In the single degenerate (SD) model 
(Whelan & Iben 1974) a WD grows in mass through ac- 
cretion from a non-degenerate stellar companion. In the 
double degenerate (DD) model (Iben & Tutukov 1984; 
Webbink 1984), two WDs merge after losing energy and 
angular momentum to gravitational waves. In both sce- 
narios, many question remain regarding the ignition and 
development of the explosion itself. It is only recently 
that the relative and absolute rates of CC SNe and SNe la 
as a function of environment and cosmic time are starting 
to be measured accurately, and therefore the respective 
quantity, types, and times of their contributions to metal 
enrichment history are still poorly constrained. 

A fundamental function that could shed light on all 
of these issues is the SN delay time distribution (DTD). 
The DTD is the hypothetical SN rate versus time that 
would follow a brief burst of star formation. The DTD is 
directly linked to the lifetimes (i.e., the initial masses) of 
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the progenitors and to the binary evolution timescales up 
to the explosion, and therefore different progenitor sce- 
narios predict different DTDs. Furthermore, the DTDs 
of different SN types dictate directly the mix of contribu- 
tions of different SN types to metal enrichment through- 
out cosmic history. Until recently, only few, and often- 
contradictory, observational constraints on the DTD ex- 
isted. One observational approach has been to compare 
the SN rate in field galaxies, as a function of redshift, 
to the cosmic star formation history (SFH). Given that 
the DTD is the SN "response" to a short burst of star 
formation, the SN rate versus cosmic time will be the 
convolution of the full SFH with the DTD. Gal- Yam & 
Maoz (2004) carried out the first such comparison, using 
a small sample of SNe la out to z = 0.8, and concluded 
that the results were strongly dependent on the poorly 
known cosmic SFH, a conclusion echoed by Forster et al. 

(2006) . 

With the availability of SN rate measurements to 
higher redshifts, Barris & Tonry (2006) found a SN la 
rate that closely tracks the SFH out to z ~ 1, and con- 
cluded that the DTD must be concentrated at short de- 
lays, ^ 1 Gyr. Similar conclusions have been reached, at 
least out to z <~ 0.7, by Sullivan et al. (2006). In con- 
trast, Dahlen et al. (2004, 2008) and Strolger (2004) have 
argued for a DTD that is peaked at a delay of ~ 3 Gyr, 
with little power at short delays, based on a decrease in 
the SN la rate at z > 1. However, Kuznetzova et al. 

(2007) have re-analyzed some of these datasets and con- 
cluded that the small numbers of SNe and their potential 
classification errors preclude reaching a conclusion. Sim- 
ilarly, Poznanski et al. (2007) performed new measure- 
ments of the z > 1 SN la rate, and found that, within 
uncertainties, the SN rate could be tracking the SFH. 
This, again, would imply a short delay time. Greggio et 
al. (2008) pointed out that underestimated extinction of 
the highest-z SNe, observed in their rest-frame ultravio- 
let emission, could be an additional factor affecting these 
results. 

A second approach to recovering the DTD has been 
to compare the SN rates in galaxy populations of differ- 
ent characteristic ages. Using this approach, Mannucci 
et al. (2005, 2006), Scannapieco & Bildstcn (2005), and 
Sullivan (2006) all found evidence for the co-existence 
of two SN la populations, a "prompt" population that 
explodes within ~ 100 — 500 Myr, and a delayed chan- 
nel that produces SNe la on timescales of order 5 Gyr. 
Naturally, these two "channels" may in reality be just 
integrals over a continuous DTD on two sides of some 
time border (Greggio et al. 2008). Totani et al. (2008) 
have used a similar approach to recover the DTD, by 
comparing SN la rates in early-type galaxies of differ- 
ent characteristic ages, seen at z = 0.4 — 1.2 as part of 
the Subaru/XMM-Newton Deep Survey (SXDS) project. 
They find a DTD consistent with a i -1 form, which is 
roughly generic for DD models (e.g. Greggio 2005; see 
§5.1, below). Additional recent attempts to address the 
issue with the "rate vs. age" approach have been made 
by Aubourg et al. (2008), Raskin et al. (2009), Yasuda 
& Fukugita (2009), and Cooper et al. (2009). 

Both of the above approaches involve averaging, and 
hence some loss of information. In the first approach, 
one averages over large galaxy populations, by associ- 
ating all of the SNe detected at a given redshift with 



all the galaxies of a particular type at that redshift. In 
the second approach, a characteristic age for a sample 
of galaxies replaces the detailed SFH of the individual 
galaxies in a SN survey. Maoz et al. (2010) recently pre- 
sented a method for recovering the DTD, which avoids 
this averaging. In the method, the SFH of every individ- 
ual galaxy, or even galaxy subunit, is compared to the 
number of SNe it hosted in the survey (generally none, 
sometimes one, rarely more). DTD recovery is treated 
as a linearized inverse problem, which is solved statis- 
tically. Maoz et al. (2010) applied the method to a 
subsamplc of the galaxies in the Lick Observatory SN 
Search (Filippenko et al. 2010; Leaman et al. 2010; Li 
et al. 2010a, b), having SFH reconstructions by Tojciro 
et al. (2009) based on data from the Sloan Digital Sky 
Survey (SDSS; York et al, 2000). Maoz et al. (2010) 
find a significant detection of both a prompt SN la com- 
ponent, that explodes within 420 Myr of star formation, 
and a delayed SN la with population that explodes af- 
ter > 2.4 Gyr . Maoz & Badenes (2010) applied this 
method also to a sample of SN remnants in the Mag- 
ellanic Clouds, compiled by Badenes, Maoz, & Draine 
(2010). Treating the remnants as an effective SN sur- 
vey conducted over ~ 20 kyr, they also find a significant 
detection of a prompt (this time < 330 Myr) SN la com- 
ponent. A related DTD reconstruction method has been 
applied by Brandt et al. (2010) to the SNe la from the 
SDSS II survey. Like Maoz et al. (2010), they detect 
both a prompt and a delayed SN la population. 

Yet another approach for recovering the DTD, which 
is at the focus of this paper, is to measure the SN rate 
vs. redshift in massive galaxy clusters. The deep po- 
tential wells of clusters, combined with their relatively 
simple SFHs, make them ideal locations for studying 
both the DTD and the metal production of SNe. Optical 
spectroscopy and multiwavelength photometry of cluster 
galaxies has shown consistently that the bulk of their 
stars were formed within short episodes (~ 100 Myr) at 
z ~ 3 (e.g., Daddi et al. 2000; Stanford et al. 2005; 
Eisenhardt et al. 2008). Thus, the observed SN rate 
vs. the elapsed cosmic time since the stellar formation 
epoch provides an almost (see §5) direct measurement of 
the form of the DTD. Furthermore, the record of met- 
als stored in the intracluster medium (ICM) constrains 
the number of SNe that have exploded, and hence the 
normalization of the DTD. 

Renzini et al. (1993) and Renzini (1997) first pointed 
out that the large mass of iron in the ICM could not have 
been produced solely by CC SNe from a stellar popula- 
tion with a standard initial mass function (IMF) . Either a 
"top-heavy" IMF, or a dominant contribution by SNe la 
is required. Maoz & Gal- Yam (2004) revisited the prob- 
lem, and tested the hypothesis that the main source of 
iron is SNe la. Accounting for the CC SN contribution 
to the observed iron mass, they calculated the number 
of SNe la, per present-day stellar luminosity, needed to 
have exploded over the entire past history of a cluster, in 
order to produce the observed iron mass. Finally, they 
compared some simple theoretical DTDs, normalized to 
produce this number of SNe, to the then-available clus- 
ter SN la rates (Gal- Yam, Maoz, & Sharon 2002; Reiss 
2000). The low observed SN la rates out to z ~ 1 implied 
that the large number of events, needed to produce the 
bulk of the iron, occurred at even higher redshifts, be- 
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yond the range of the then-existing observations. Maoz 
& Gal- Yam (2004) therefore concluded that SNe la can 
be the main source of iron only if most of them explode 
during the relatively brief time interval between star for- 
mation in massive clusters (at z ~ 2 — 3) and the highcst- 
rcdshift cluster SN rate measurements (at z ~ 1). In 
other words, the majority of SNe la (at least those that 
occur in a galaxy cluster environment) must have a short 
time delay, < 2 Gyr. 

A shortcoming of the work by Maoz & Gal- Yam (2004) 
was the paucity and quality of the cluster SN rate data it 
was based on. The low-z cluster SN rate by Reiss (2000) 
was never published in the refereed literature. The clus- 
ter SN rates at z = 0.25 - 0.9 by Gal- Yam ct al. (2002), 
were based on very few events - two or three - discov- 
ered in archival Hubble Space Telescope (HST) WFPC2 
images of clusters. The large Poisson and systematic un- 
certainties in the resulting rates precluded any detailed 
discrimination among model DTDs. 

Over the past few years, the observational situation 
has improved dramatically. Accurate cluster SN la rates 
have been measured in the redshift range of to 1.5, with 
smaller errors than the previous sole published study at 
<z>= 0.25,0.90 by Gal- Yam et al. (2002). In this pa- 
per, we utilize this wealth of new measurements for a 
renewed analysis of the cluster SN rate and its implica- 
tions for the SN la DTD and for cluster metal-enrichment 
history. We show that the new data permit a direct 
comparison of the observations with the functional be- 
havior of various DTDs that have been proposed in the 
literature, whether parametrized-mathematical or phe- 
nomenological DTDs, or DTDs resulting from physical 
considerations at different levels of sophistication. We 
find that few of these models, in their simplest forms, 
are compatible with the emerging observational picture, 
and therefore we have reached the point where the ob- 
servations discriminate among models. In §2, below, we 
compile the existing cluster SN la measurements. In §3, 
we review the existing measurements of the properties of 
clusters, particularly the gas-to-stellar mass ratio, that 
are relevant for estimating the present-day cluster iron- 
to-stellar mass ratio. This ratio, in turn, sets the number 
of SN la that have exploded in clusters, per unit stel- 
lar mass formed, which then fixes the normalization of 
the DTD. In §4 we use the rates and the normalization 
to recover the observational DTD. We take a forward- 
modeling approach starting in §5, where we compare 
the observed rates to the DTD predictions of various 
"single-component" models that invoke a single math- 
ematical form or physical progenitor formation channel. 
In §6, we relax the assumption of a single, instantaneous, 
star-formation episode in clusters, and examine whether 
an extended star-formation history, combined with some 
single DTD, can reproduce the measurements. In §7, we 
examine the ability of composite models, which mix di- 
verse DTDs, or multiple episodes of star formation, to 
match the observations. We summarize our results in 
§8. Throughout the paper we assume a cosmology with 
parameters Qa = 0.7, TO = 0.3, and H = 70 km s _1 
Mpc" 1 . 

2. CLUSTER SN IA RATES 
2.1. Existing cluster SN rates 



To perform our analysis, we first compile the cluster 
SN la rate measurements currently in existence. The re- 
cent measurements are at mean redshifts of <z>= 0.02 
(Mannucci et al 2008); <z>= 0.15 (Sharon et al. 2007; 
Gal- Yam et al. 2008); <z>= 0.08,0.23 (Dilday et al. 
2010); <z>= 0.46 (Graham et al. 2008); <z>= 0.60 
(Sharon et al. 2010); and <z>= 1.12 (Barbary et al. 
2010). The mean redshifts are visibility-time- weighted 
means for the clusters monitored by each survey, i.e., in 
the calculation of the mean, the redshift of each cluster in 
the survey sample is weighted by the effective time during 
which a SN la would have been visible in the survey (see, 
e.g., Sharon et al. 2007). Before these recent measure- 
ments, the sole modern published cluster rates were at 
<z>= 0.25,0.90 (Gal- Yam et al. 2002). Table 1 summa- 
rizes, for each of these measurements, the mean redshift 
of the sample, the redshift range, the cosmic times cor- 
responding to these redshifts in our assumed cosmology, 
and the SN la rate, normalized by stellar mass. The 
stellar masses, at the cosmic time corresponding to the 
redshifts of the clusters, have generally been estimated 
consistently in the respective papers, based on the ob- 
served stellar luminosities and colors of the monitored 
cluster galaxies, and assuming the same IMF - the "diet 
Salpeter" IMF of Bell et al. (2003). Exceptions to this 
are: Dilday et al. (2010), who converted their luminosity- 
normalized rates to mass-normalized rates assuming that 
the mean M/Lb ratio found by Sharon et al. (2007) is 
valid for their cluster sample, which spans a similar red- 
shift range; Gal- Yam et al. (2002), whose rates were mass 
normalized by Sharon et al. (2010), using M/Lb from 
Sharon et al. (2007) at low z and M/Lb from Sharon 
et al. (2010) at high z; and Graham et al. (2008), who 
estimate masses for the galaxies in their sample using the 
spectral population synthesis of Buzzoni (2005) who, in 
turn, assumed a Salpeter (1955) IMF. We therefore scale 
up the Graham et al. (2008) rate by a factor of 1.77, 
which is the ratio between the remaining stellar mass in 
an old and inactive quiescent population in a Salpeter 
(1955) IMF, and that in a diet-Salpeter IMF (see §3.4, 
below, for details). We follow Barbary et al. (2010) in 
scaling up the rate of Sharon et al. (2010) by 35%, to 
account for the offset in the M/L g vs. g — r relation of 
Bell et al. (2003), expected in the younger galaxies that 
exist at <z>= 0.6. 

2.2. Observed cluster SN la rates vs. redshift 

Figure 1 shows the rates we have compiled, and which 
we analyze in this paper. It suggests only a mild rise in 
the rates, by a factor of ~ 2 — 5, over the redshift interval 
probed. Remarkably, the cluster SN rate appears not to 
change much from z = to z = 1.45, corresponding to a 
lookback time of 9.8 Gyr. This is only about 2 Gyr after 
the formation of the cluster stars, assuming this occurred 
at a redshift Zf ~ 3. 

3. TIME-INTEGRATED SN NUMBER PER FORMED MASS 

As pointed out by Maoz & Gal- Yam (2004) and Sharon 
ct al. (2010), and now further suggested by the results 
above, the integral over the SN la rate from z = to z — 
1.45, times the mean iron yield of SNe la (0.6— O.7M ; see 
below), gives just a small fraction, roughly 10%, of the 
present-day ratio of iron mass to stellar mass observed in 
clusters. It is then unavoidable that the large majority 
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TABLE 1 

Cluster SN Ia Rates and Delay Time Distribution 

Rcdshift Cosmic time SN Ia rate Delay DTD Rcf. 

z t R Ia (t) r *(r) 

[Gyr] [SNuM] [Gyr] 

(1) (2) (3) (4) (5) (6) 



0.020±g-^ 13.2^ O.OmtolU ll-lloa 3.6±if M08 

4l« : °^ 12At% 0.060t° : ^ 10.3+H 3*±i2 D1 

^-0.090 U-6t£:2 0.098t°;EJf 8 9.515;" 5-5^: 8 7 S07 

0.225±g;°li 10.8i£;? 0.088t°;^ 8.715:1 4.9±i;f D10 

0.250l°^o l0.5l°I 0.110±g-J?g 8.4±°"* 6.2±»-g GY02 

0-460lg : ^o 8 .7l? : g 0.177« : ?12 6 . 6 +2 : 3 io.I^ 1 G08 

0.600l° : ^° 7.8l'^ 0.151±g : l?| 5.7±? : ? 8.7±j"l S10 

0.900t° : jg° 6.2±? : 2 0.220t° : ?5° 4.1±? : | 12.9±5 4 i 6 GY02 

1.120t°J30 5 .3t°* 0.364«-J0; 3.2±°-j> 21.6±i| : | BIO 

l.ltl;! 230 t JJj txt 



(1) - Visibility-time-wcigthcd mean rcdshift of cluster sample, and redshift 
range. 

(2) - Time since Big Bang corresponding to redshifts in (1). 

(3) - SN rate per unit stellar mass, in SNuM (10~ 12 SNc yr _1 M ~ 1 ), with 
normalization relative to remaining mass at the redshift of the observation. 

(4) — Time since Zf — 3 corresponding to redshifts in (1). 

(5) — Mean recovered DTD value in time bin of (4), in units of 
10~ 14 SNe yr — 1 Mq — 1 , with normalization relative to formed mass. 

(6) - References: M08 - Mannucci et al. (2008); D10 - Dilday ct al. (2010); 
S07 - Sharon ct al. (2007); GY02 - Gal- Yam ct al. (2002); G08 - Graham et 
al. (2008); S10 - Sharon ct al. (2010); B10 - Barbary ct al. (2010); txt - DTD 
value derived in this work, based on iron-mass constraints. 

Published rates have been converted to the same "diet Salpeter" IMF (Bell et 
al. 2003). S10 rate has been scaled up to account for expected evolution of the 
Bell ct al. (2003) M/L-to-color relations, according to B10. 




Fig. 1. — SN Ia rates per unit stellar mass in galaxy clusters, 
as a function of redshift, as listed in Table 1. Observed rates, 
here and in the subsequent Figs. 3—13, are in units of SNuM: 
10~ 12 SNc yr- 1 M Q - 1 . Horizontal error bars mark the cluster 
redshift intervals of the respective SN surveys, with the central 
value at the visibility-time-weighted mean redshift of each survey. 
Vertical error bars show the summed 68% confidence limit statis- 
tical and systematic uncertainties. All measurements have been 
consistently scaled to the "diet Salpeter" IMF (see text). Labels 
arc: M08 - Mannucci ct al. (2008); D10 - Dilday et al. (2010); 
S07 - Sharon ct al. (2007); GY02 - Gal- Yam et al. (2002); G08 - 
Graham et al. (2008); S10 - Sharon et al. (2010); B10 - Barbary 
ct al. (2010). 



of cluster metals were produced within the first 1-2 Gyrs 
after the formation of the cluster stars. The metals were 
produced by CC SNe and perhaps by "prompt" SNe Ia, 



but they were not produced by any SNe with a larger 
delay, as the rate measurements show that only few such 
explosions took place at those later times. 

In order to juxtapose the new cluster rates, compiled 
above, with model expectations, we revisit the question 
of the iron-to-stellar mass ratio in clusters, which dictates 
the normalization of the DTD. In addition to consider- 
ing the latest data on cluster properties, our treatment 
will differ from the one in Maoz & Gal- Yam (2004) in 
several respects, most notably in that we will use mass- 
normalized, rather than luminosity-normalized SN rates, 
and we will account for mass evolution due to stellar 
mass loss. Stellar mass is, of course, more closely related 
to the number of stars than is blue luminosity, and is a 
more stable quantity. 

The mass of iron in clusters that is attributable to 
SNe Ia is the sum of the iron masses in the ICM and 
in stars, minus the mass of iron produced by CC SNe. 
Thus, the ratio between SN la-produced iron mass and 
the present-day stellar mass is 

M Fe ,i a _ q QQ74[ -^ Fc >0 ^ M gas /M*$ Z FC;gas ^ Z Fc 



M, 



*,o 



L 0.0026 V 10 0.3 

M(> 8M )/M,, O Vcc , 



1.2 



(1) 



0.35 



0.01 J 



We discuss below each of the quantities that enter this 
equation. 

3.1. The solar iron abundance 
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The value of the photospheric solar abundance of iron, 
Z Fc ,0 = 0.0026, is from Anders and Grevesse (1989). 
Although the correct photospheric iron abundance has 
been revised down to 0.00176 (Grevesse & Sauval 1999), 
the ICM and stellar abundances we quote below, and 
which are still generally used in the literature, relate to 
the Anders and Grevesse (1989) value 5 . 

3.2. The gas-to-star mass ratio 

The present-day mass ratio of baryons in the ICM gas 
and in stars, M gas /M* , has been recently re-evaluated 
by Gonzalez, Zaritsky, & Zabludoff (2007), taking into 
account the contribution to M^o by the intergalactic 
stellar population of a cluster. Contrary to Lin et al. 
(2003), who argued for M gas /M* j0 that is independent 
of total cluster mass, Gonzalez et al. (2007) find that 
M gas /M* t o rises monotonically with cluster mass, from 
a value of about 5 for low-mass clusters to about 15 for 
the most-massive systems. To find the suitable values for 
the clusters from which were derived the SN rates that 
we analyze, we now examine the typical masses of those 
clusters. 

The most massive cluster sample in our SN rate com- 
pilation is that of Sharon et al. (2010), which is largely 
coincident with the z > 0.5 MACS cluster sample of 
Ebcling et al. (2007). The clusters in this sample have 
total masses M500 ~ 10 15 Mq, where M500 is the mass 
within a spherical volume in which the mean density is 
500 times the critical closure density. In the Sharon et 
al. (2010) sample, the part of the rest-frame B-band 
luminosity of each cluster that is included in the HST 
field of view (and which is therefore monitored for SNe) 
is (1 — 6) x 10 12 L_b,q, and the corresponding stellar mass 
range is (4-17) x 10 12 M Q , with a mean of 1O 13 M . Such 
masses also apply to the massive, X-ray-selected clusters 
at <z>= 0.25,0.90 studied by Gal- Yam et al. (2002). 
For the <z>— 0.15 clusters monitored by Gal- Yam et al. 
(2008), the typical stellar mass is 7 x 1O 12 M (Sharon et 
al. 2007). From Gonzalez et al. (2007), this corresponds 
to total masses of M 500 — 4 x 1O 14 M . 

Barbary et al. (2010) find, for their <z>~ 1.12 clus- 
ters, a mean i?-band luminosity, of « 3 x 1O 12 Lb i0 , and 
typical stellar masses of w 4 x 1O 12 M . As a check, the 
bulk properties of the Barbary et al. (2010) clusters, such 
as X-ray luminosities, X-ray temperatures, and velocity 
dispersions, also suggest masses about one-half as large 
as those of the cluster sample of Sharon et al. (2010) at 
<z>— 0.60. Well-measured X-ray temperatures exist in 
the literature for five of the Barbary et al. sample clus- 
ters, and are all around 6-7 keV (Rosati et al. 2004; Stan- 
ford et al. 2002, 2006; Boehringer et al. 2008; Gilbank 
et al. 2008). This compares to the mean temperature, 
<kT>= 9 keV, of the z > 0.5 MACS cluster sample of 
Ebeling et al. (2007). X-ray luminosities, reported for 
these five Barbary et al. clusters, and for an additional 
four of their clusters (Rosati et al. 1999; Postman et al. 
2001; Bremer et al. 2006; Andreon et al. 2008) are in 
the range ~ 0.7 — 16 x 10 44 erg s _1 , with a mean of ~ 

5 Lin et al. (2003) derived an iron-to-stellar mass ratio, using a 
fairly low ICM iron abundance of 0.21 solar found by De Grandi & 
Molendi (2001), but coupled it to the meteoritic iron abundance of 
Anders and Grevesse (1989), 0.00181. As a result, the ratio they 
found is a factor of ~ 2 lower than we find here. 
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4 x 10 44 erg s _1 , compared to <Lx>= 16 x 10 44 erg s _1 
for the Ebcling et al. (2007) sample. Galaxy velocity dis- 
persion best-fit measurements for seven of the Barbary 
et al. clusters are in the range of « 600 — 1300 km s _1 
(references above, plus Eiscnhardt et al. 2008; Cain et 
al. 2008), with a mean of about 850 km s~ , compared 
to <cr>= 1300 km s _1 for the z > 0.5 MACS clusters. 
Since total mass depends (e.g., Hicks et al. 2008) on 
X-ray temperature roughly as Ma T 1,5 , on X-ray lumi- 
nosity roughly asMa L^ 5 , and on velocity dispersion as 
M oc (j 2 , all of these observables suggest that the typical 
cluster in the Barbary et al. sample has about one-half 
the mass of the typical cluster in the Sharon et al. (2010) 
sample. Assuming a constant fraction of the cluster mass 
in stars, the stellar mass in the Barbary et al. sample 
would also be of order one-half that in the Sharon et 
al. (2010) sample. However, the stellar mass fraction in 
clusters depends on total cluster mass. For example, An- 
dreon (2010) has recently found a dependence of roughly 
/stars oc M~ ' 5 . If this relation held also for clusters at 
rcdshifts z > 1, the stellar mass in the Barbary et al. 
clusters would be only about 1.4 times lower than in the 
Sharon et al. (2010) sample, and thus more similar to 
the <z>— 0.15 clusters monitored by Gal- Yam et al. 
(2008), 

The optically selected clusters monitored for SNe by 
Graham et al. (2008) and Dilday et al. (2010) have 
masses comparable to those of the samples discussed 
above, though somewhat lower. Graham et al. (2008) 
derived a total stellar mass of M* = 8x 1O 13 M for the 
30 clusters they used for their SN rate measurement, or 
- 3 x 1O 12 M for a typical cluster. Dilday et al. (2010) 
report a total r-band luminosity of L r = 10 14 L riQ for 
the 71 clusters in their low-z, "C4", cluster subsample. 
Taking M*/L r = 3 (Sharon et al. 2007), this gives a 
mean stellar mass per cluster of <~ 4 x 1O 12 M . For 
the 492 clusters in the <z>= 0.225 "maxBCG" sam- 
ple of Dilday et al. (2010), the integrated luminosity is 
L r = 2 x lO 14 L ri0 , and hence the mean stellar mass per 
cluster is - 1 x 1O 12 M . Thus, this one subsample of 
Dilday et al. (2010) has clusters that are undermassive, 
compared to the other cluster samples we analyze. The 
other samples have only about a factor-3 difference in 
mean cluster mass between the higher- and lower-mass 
extremes. 

From the fits by Gonzalez et al. (2007), the lower 
and higher typical masses considered above (excluding 
the lower-mass maxBCG sample of Dilday et al. 2010) 
correspond to M gas /M* ; o values of 6.5 and 14.5, respec- 
tively. However, Gonzalez et al. (2007) used the relation 
obtained by Cappellari et al. (2006) between the kine- 
matically measured total-mass-to- luminosity ratio, M/L 
of nearby ellipticals and their /-band luminosity. Cap- 
pellari et al., when comparing their results to M/L esti- 
mates for the same galaxies based on spectral population 
synthesis, concluded that 30% of the contribution to their 
kinematic M/L could be due to dark matter within the 
central regions. If so, the stellar masses in clusters found 
by Gonzalez et al. (2007) would be scaled down by 0.7, 
and hence M gas /M^,fi would increase to 9.3 for, e.g., the 
<z>= 0.15 cluster sample, and to 21 for the <z>— 0.60 
cluster sample. 

Lagana et al. (2008) re-analyzed X-ray and optical 
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data for five clusters to deduce the stellar and gas mass 
fractions. To obtain stellar masses, they used the Kauff- 
mann et al. (2003) Mj L ratios, which are consistent with 
the Bell et al. (2003) M/L ratios, used in deriving the 
SN rates analyzed here. For clusters with total masses, 
M 50 o, similar to the two ends of the typical mass range 
considered here, Lagana et al. (2008) find M gas /M* ; o 
of about 8 and 17, respectively. Giodini et al. (2009) 
also recently examined the dependence of stellar and gas 
mass fractions on M 500 . Gas mass fractions were esti- 
mated for 41 clusters compiled by Pratt et al. (2009) 
from the literature. Stellar mass fractions were obtained 
for a sample of 91 groups and poor clusters from the 
COSMOS survey, based on spectral population synthe- 
sis fitting of their spectral energy distributions. These 
were combined with the re-evaluated stellar masses of 27 
clusters from Lin et al. (2003). We scale down their stel- 
lar masses by 0.7, to transform from the Salpeter (1955) 
IMF they assumed to our assumed "diet Salpeter" IMF 
(see below). For the two ends of the cluster mass range 
considered above, the best-fit relations of Giodini et al. 
(2009) then give M gas /M^fi of about 7 and 11, respec- 
tively. Finally, a recent analysis by Andreon (2010) of 52 
clusters, chosen to have accurate measurements, results 
in similar relations, with values of M gas /M* v0 of about 7 
and 13.5 for the two typical cluster masses, respectively. 

Considering all of the above studies, we see that the 
estimates of M gas /M* y o are in the range of 6 — 10 for the 
lower-mass clusters of the type monitored by most of the 
\ow-z surveys whose SN rates we analyze, 11 — 21 for 
the massive, intermediate- z clusters monitored by Gal- 
Yam et al. (2002) and by Sharon et al. (2010), and 
somewhere inside this range for the high-z cluster sample 
of Barbary et al. (2010). (We also note that these ratios 
appear to be independent of redshift; e.g. Giodini et al. 
2009). When considering the iron mass in clusters and 
the number of SNc la needed to produce it, we therefore 
choose Mgas/M^o = 10 as our "optimal" fiducial value, 
but we will also consider the consequences of a "minimal 
iron" value of M gas /M*^ = 6. 

3.3. The cluster iron adundance 

A "canonical" value of Zp c _ gas = 0.3 is by now fairly 
well-established for the ICM gas iron abundance, rela- 
tive to solar. Balestra et al. (2006), Maughan et al. 
(2008), and Anderson et al. (2009) argue for values that 
have evolved even higher, to 0.4 — 0.5, at low redshifts 
(but see Ehlert & Ulmer 2009). To err conservatively, 
we adopt Z-p eigas = 0.3 for out fiducial value. These iron 
abundances typically relate to the central ~ 1 Mpc ra- 
dius of massive clusters, the same region in which the 
stellar luminosity is measured and the SNe are detected 
by the cluster SN surveys we consider. Nevertheless, be- 
cause the X-ray emission is strongly centrally peaked, 
there is some concern that the above iron abundance 
could be strongly biased by a high abundance in the 
very core region, which might be unreprcscntatative of 
a much lower abundance existing over most of the vol- 
ume within 1 Mpc. However, Maughan et al. (2008) 
have shown that measuring iron abundances within an 
annulus that excludes the central 15% in radius reduces 
the results only mildly, to values still consistent with the 
canonical Z-p Ctgas = 0.3. For the stellar iron abundance 
relative to solar, we follow Maoz & Gal- Yam (2004) and 



Lin et al. (2003) in adopting the value i?Fc,* = 1-2 from 
the study by Jorgensen et al. (1996). 

3.4. The CC SN contribution 

M{> 8M )/M* iO is the ratio between the initial mass 
in stars of mass above 8M Q , which will explode as CC 
SNe, and the present-day mass in stars in clusters. Maoz 
& Gal- Yam (2004) examined the dependence of this ra- 
tio on the choice of IMF for a large range of standard 
and non-standard IMFs. In the present work, we will 
consistently assume the "diet Salpeter" IMF of Bell et 
al. (2003), as this is the IMF that has been assumed 
when determining rates per unit mass in the SN surveys 
that we analyze. This IMF is like the Salpeter (1955) 
single-power-law IMF with slope —2.35 between 0.1 and 
100M Q . However, when calculating observables involv- 
ing mass, such as M/L, the total initial mass in stars is 
scaled down by a factor of 0.7, to simulate the deficit, 
relative to the Salpeter IMF, of low-mass stars in real- 
istic IMFs. The ratio of mass in stars of mass above 
8M© to the total initial mass, in such an IMF, is 0.20. 
From Bruzual & Chariot (2003), for a Salpeter (1955) 
IMF, during the stellar evolution of a 10 Gyr-old pop- 
ulation 31% of the stellar mass is returned to the in- 
terstellar medium (ISM) via stellar winds and SN ex- 
plosions. For a diet Salpeter IMF, this mass-loss frac- 
tion is 0.31/0.7 = 44%, and hence the present-day ratio 
M(> 8M )/M* iO w 0.2/0.56 = 0.35. This is, in the 
present context, a conservative estimate, in that it max- 
imizes the contribution of CC SNe, relative to SNe la, to 
cluster iron production. If, as generally believed, stars 
in some mass ranges collapse directly into black holes, 
without a SN explosion and its contribution to iron pro- 
duction, then the appropriate mass ratio to be used here 
would be lower. For example, if only stars up to 50M Q 
or 25M Q explode as CC SNe, M(> 8M )/M,, O will be 
reduced to 0.29 or 0.20, respectively. 

The diet Salpeter IMF gives a very similar M(> 
8M Q )/M». ratio to that of the Kroupa (2001) and Gould 
et al. (1997) IMFs (see Maoz & Gal- Yam 2004 and Bell 
et al. 2003). In any case, since both the SN rate and 
the iron mass in clusters are normalized relative to stel- 
lar mass, as derived from observed stellar luminosity, the 
conclusions are insensitive to the assumed form of the 
IMF in the low-mass range, as long as it is assumed con- 
sistently for both. The ratio M(> 8M )/M* ;O also de- 
pends weakly on the assumed age of the population, as 
long as it is of order several Gyr or more, since most of 
the return of mass to the ISM occurs early on. 

Finally, we follow Maoz & Gal- Yam (2004) in as- 
suming that the iron yield of a CC SN, y cc , is 1% of 
the initial mass of the progenitor star. As discussed 
there, this is likely to be a conservative overestimate 
as well. For M{> 8M )/M*, O = 0.35 and the fidu- 
cial values of the metallicities, this means about 1/3 of 
the cluster iron mass was produced by CC SNc. For 
M(> 8M )/M* iO = 0.20, less than 1/5 of the iron mass 
is from CC SNe. Alternatively, for the "minimal iron" 
value of Mgas/M^o = 6, the core-collapse contribution is 
between 1/2 and 1/4. 

3.5. The SN la iron yield and correction for stellar 
mass evolution 
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Dividing Eq. 1 by the mean iron yield of SNe la gives 
the time-integrated number of SNe la that have exploded 
in clusters, per present-day unit stellar mass, A^n/M^o- 
The mean iron yield of a SN la is 0.6-0.7 M© (e.g., Maz- 
zali et al. 2007). In line with our conservative approach, 
we will assume the higher value of 0.7 M©, thus lowering 
the integrated number of SNe la required to explain the 
observed abundances. 

The time-integrated number of SNe la per unit formed 
stellar mass, before the 44% mass loss that occurs over 
the course of ~ 10 Gyr (see §3.4, above), will be 0.56 
times lower. Multiplying Eq. 1 by 0.56/0.7M©, we thus 
obtain the time-integrated number of cluster SNe la per 
unit formed stellar mass, 

( ^Tjr J = 0.0059 M©- 1 = 5.9 SNuM Gyr, (2) 

V M * J opt 

assuming the "optimal" value of M gas /M„ = 10, and 
the other fiducial values in Eq. 1. Alternatively, for the 
"minimal" value of M gas /M* ; o = 6, we obtain 

( = °' 0034 M ® _1 = 3 ' 4 SNuM Gyr ' ^ 

V * / min 

4. A RECONSTRUCTION OF THE OBSERVATIONAL SN 
DELAY-TIME DISTRIBUTION 

In this section, we combine the observed cluster SN 
rates, Ri a {t) (from §2) and the integrated SN la number 
per formed stellar mass, Nsn/M* (from §3), to recover 
the SN la DTD. In the next section, we take an alterna- 
tive, forward-modeling, approach, to examine the predic- 
tions of a variety of specific model DTDs. Throughout 
this section, we assume that all stars in clusters formed in 
a single instantaneous burst, neglecting any subsequent 
star formation. As already noted above, this is a fair 
approximation to the results of spectral synthesis of the 
mass-dominant stellar populations in cluster ellipticals, 
indicating a burst lasting ~ 0.1 Gyr at Zf « 3 (corre- 
sponding to cosmic time tf w 2 Gyr), i.e., some 11.5 Gyr 
ago. The consequences of relaxing the instantaneous- 
burst assumption are studied in §6-7, below. If, in fact, 
the burst assumption is valid, recovery of the DTD is 
straightforward. The SN rate since the epoch of star 
formation differs from the DTD only in the stellar-mass 
normalization - the DTD is the SN rate normalized by 
unit stellar mass at the formation epoch, M*, while the 
SN rate is normalized by the remaining stellar mass, 
M*(£) = M* m(t), at the cosmic epoch of the rate mea- 
surement. Here, m(t) is the remaining fraction of the 
initially formed stellar mass at cosmic time t (since the 
Big Bang). The observed SN rates per unit mass at cos- 
mic times t are the values of the DTD at delays t — tf, 
up to the correction, easily applied, to convert from the 
existing mass at time t to the formed mass at tf, ac- 
counting for mass loss during stellar evolution. The SN 
rate and the DTD are thus related by 



Rla(t) 



"tit - tf) 

m(t-t f )' 



(4) 



Bruzual & Chariot (2003) tabulate the relative accu- 
mulated stellar mass loss versus time, m\ oss (t), following 
a burst of star formation, assuming a Salpeter (1955) 
IMF. For the "diet Salpeter" IMF, the remaining mass 



fraction is m(t) = l — m\ oss (t)/0.7. Starting 2.5 Myr after 
star formation, the remaining mass m(t) is well approx- 
imated as (i/2.5 Myr)-°- 07 . The predicted SN rate will 
therefore decline slightly less steeply than the DTD. We 
convert the observed SN rates, i?/ a , to the DTD, ^(t), 
using Eq. 4 with the full Bruzual & Chariot (2003) mass 
loss (rather than the power- law approximation). 

To estimate the DTD at delay times earlier than those 
corresponding to the redshift, 2; max , of the most distant 
clusters monitored, we can use the Nsn/M* constraints 
from the iron observations. The total number of SNe 
per unit formed stellar mass between Zf and z max is 
(A SN /M«) z/ , Zmax = (A SN /M») - (A SN /M*) Zmax>0 , where 

(-^ r SN/A^*) Zmax>0 is the total number between z — and 
z max , obtained by integrating the DTD over that range. 
The mean DTD value in the time interval [0,i max — tf], 
corresponding to the unobserved redshift range Zf to 
is just {N SN /M*) ZftZin&x / (i max - t f ). 

To estimate (AsN/M»)z ma x,o> we integrate, using small 
time steps, over the mass-loss-corrected SN rates. At 
times that are covered by several measurements, we take 
a mean rate, weighted by the relative measurement er- 
rors. Out to z = 1.45, we find an integrated SN to stellar 
mass ratio of (A SN /M») Zmax = 8.2 x 10~ 4 M©" 1 . Sub- 
tracting this from the minimal-iron value of 0.0034 M© -1 
(Eq. 3) leaves 0.0026 M©" 1 in the time interval of 2.2 Gyr 
between Zf = 3 to z max = 1.45, or a mean DTD value 
of 0.012 SNe yr- 1 (lO lo M )- 1 in this delay time interval 
of — 2.2 Gyr. For the optimal-iron value (Eq. 2) of 
0.0059 M©- 1 , the mean DTD value in this time bin is 
0.023 SNe yr-^lO^M©)" 1 . We take this range of DTD 
values as the uncertainty, which is driven in this case 
mainly by the systematic uncertainty in the cluster gas- 
to-stellar mass ratio. For our best estimate we take the 
optimal-iron value. We note that this exercise implies 
that, for the minimal and optimal iron assumptions, re- 
spectively, 79% to 88% of the SNe la in clusters exploded 
before z — 1.45. 

Figure 2 shows the recovered DTD, whose values are 
also listed in Table 1. The horizontal error bars mark 
the limits of each DTD time bin, while the vertical error 
bars show the summed (not in quadrature) and prop- 
agated statistical and sytematic errors. Also shown in 
Fig. 2 are three other recently recovered observational 
SN la DTDs. One (empty circles) is from Maoz et al. 
(2010), obtained for a subsample of the local galaxies in 
the Lick Observatory SN Search having individual SFH 
reconstructions based on SDSS spectra. Wc have plot- 
ted here the Maoz et al. (2010) DTD from a subsample 
that excluded galaxies of Hubble types Sa to Sbc, in or- 
der to reduce cross talk between time bins. Due to the 
limited aperture of the SDSS spectrograph fibers, there 
remains a "leak" of signal from the first to the second 
time bins, and therefore the plotted DTD level in the 
40 — 420 Myr time bin is likely an underestimate of the 
true level (see Maoz et al. 2010). Another DTD shown 
(empty triangles) was found by Maoz & Badenes (2010), 
using the SN remnants in the Magellanic Clouds as an 
effective SN survey. In this DTD, there is a detection in 
one time bin, of t < 330 Myr, but only an upper limit 
on the DTD value between 330 Myr to a Hubble time, 
which is indicated. Finally, we show (empty squares) the 
DTD recovered by Totani et al. (2008), using SN la can- 
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Fig. 2. — Recovered SN la delay-time distribution (DTD) for 
galaxy clusters (filled circles). All but the earliest point are ob- 
tained directly from the observed rates in Fig. 1, after correcting 
them for stellar mass loss, and assuming an instantaneous star- 
formation burst at Zf = 3 for all clusters. Horizontal error bars 
indicate the time bin over which each mean DTD value is evalu- 
ated. The first point is obtained by requiring a time-integrated 
number of SNe la according to Eqns. 2-3, satisfying the observed 
iron-to-stellar mass ratio in clusters. Also plotted, for comparison, 
arc the recovered observational DTDs from: Maoz et al. (2010), 
based on SNe in nearby galaxies from LOSS (empty circles); Maoz 
& Badcncs (2010), based on SN remnants in the Magellanic Clouds 
(empty triangles, with arrow marking the 95% confidence upper 
limit); and Totani et al. (2008) based on SN la candidates in ellip- 
tical galaxies at z = 0.4 — 1.2 (empty squares). For reference, we 
also show (curves) power laws, t~ a , with s = —1.1 and s = —1.3, 
scaled to pass through the latest cluster-based point. 

didates in SXDS field elliptical galaxies at z = 0.4 — 1.2. 
We have divided the DTD values of Totani et al. by a 
factor 1.8, given by them, to convert from their normal- 
ization unit, of present-day if-band luminosity, to ours, 
of formed stellar mass. We have then divided by a fur- 
ther factor 0.7 to convert from their assumed Salpctcr 
(1955) IMF to our diet-Salpeter IMF. We also show, for 
reference, two power laws, oc t -1-1 and $ oc t -1 ' 3 , 
which are discussed further in § 5.1, below. 

Several facts are apparent in Fig. 2. First, consider- 
ing the different time bins of the different measurements, 
and the known systematics, the various measurements 
are generally consistent with each other in regions of 
overlap. Second, the DTD is a monotonically decreas- 
ing function, peaking at the earliest delays probed by 
the observations. Finally all the plotted points appear 
to be generally consistent with the illustrated power-law 
dependences, which, remarkably, pass through each and 
every error box. This consistency is tested quantitatively 
in § 5.1. With this new DTD derivation, combined with 
those by Maoz et al. (2010) and Maoz & Badencs (2010), 
we thus confirm and extend the results of Totani et al. 
(2008), who found in their field elliptical SN sample a 
best-fit DTD dependence of a power law, oc t s , with 
s = 1.08 ± 0.15. This is also further discussed below, in 
§5.1. 

5. COMPARISON TO DTD MODELS - 
SINGLE-COMPONENT MODELS 

In this section, we take a forward-modeling approach, 
to compare the predictions of specific model DTDs to the 
observations. Foward modeling is advantageous in that 
the actual data are not manipulated and errors need not 



be propagated. Furthermore, this approach will permit 
us later (§6-7) to relax the assumption of a single short 
burst of star formation in clusters. 

A model DTD that we test, &(t), in order to be con- 
sistent with observed cluster iron abundances, needs to 
have a normalization such that its integral over a cluster 
stellar age, to, agrees with the above, time- integrated, 
SN numbers: 

jf .(.)*- 3*. (5) 

This is one constraint on the DTD, imposed by the clus- 
ter iron abundances. A second set of constraints is im- 
posed by the observed time dependence of the SN rate, 
which can be compared to the model DTD predictions. 
The predictions will be tested with the \ 2 figure of merit. 
The SN rate at each visibility-time-weighted mean red- 
shift is compared with the model prediction, with the 
latter averaged over the redshift interval spanned by the 
cluster sample used for deriving each SN rate. Errors on 
the observed rates are generally asymmetric. For the \ 2 
calculation, we use the error that is on the model side of 
each data point. 

Throughout this section, we assume that all stars in 
clusters formed in a single instantaneous burst, neglect- 
ing any subsequent star formation. As already noted 
above, this is a fair approximation to the results of spec- 
tral synthesis of the mass-dominant stellar populations 
in cluster ellipticals, indicating a short burst at zj « 3. 
As noted in §4, the SN rate since the epoch of star for- 
mation differs from the DTD only in the stellar-mass 
normalization, whether formed mass, M*, or remaining 
stellar mass, M*(t) = M* m{t), at the epoch of the rate 
measurement, and we again use Eq. 4, this time to con- 
vert the DTD model, &(t), to a rate prediction, Ri a (t). 

Various forms have been proposed for the DTD, some 
derived from detailed binary population synthesis calcu- 
lations (e.g., Yungelson & Livio 2000; Han & Podsiad- 
lowski 2004; Ruiter et al. 2009; Bogomazov & Tutukov 
et al. 2009; Mennekens et al. 2010); some physically 
motivated mathematical parameterizations, with vary- 
ing degrees of sophistication (e.g., Madau et al. 1998; 
Greggio 2005; Totani et al. 2008); and some ad hoc for- 
mulations intended to reproduce the observed field SN 
rate evolution (e.g., Strolger et al. 2004). We now at- 
tempt to fit the observed cluster SN la rates with some of 
these proposed DTDs, using Eq. 4, while simultaneously 
satisfying the DTD normalization constraint, Eq. 5, set 
by the iron abundances in clusters. 

5.1. Power-law DTDs 

A first and simple mathematical parametrization of the 
DTD that we compare to the observed cluster SN rates 
is a power law in time, ^f(t) oc t s . Power laws have 
been long considered as possible forms of the DTD (e.g., 
Ciotti et al. 1991; Sadat et al. 1998). As noted by pre- 
vious authors (e.g., Greggio 2005; Totani et al. 2008) a 
power-law dependence is generic to models (such as the 
DD model) in which the event rate ultimately depends 
on the loss of energy and angular momentum to grav- 
itational radiation by the progenitor binary system. If 
the dynamics are controlled solely by gravitational wave 
losses, the time t until a merger depends on the binary 
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separation a as 



t 



If the separations are distributed as a power law 
dN_ 
da 



then the event rate will be 



(W _ dJVda (e 
dt da dt 



-3)/4 



(6) 



(7) 



(8) 



For a fairly large range around e w — 1, which describes 
well the observed distribution of initial separations of 
non-interacting binaries (see Maoz 2008 for a review of 
the issue in the present context), the DTD will have a 
power- law dependence with index not far from —1. How- 
ever, in reality, the binary separation distribution of WDs 
that have emerged from their common envelope phase 
could be radically different, given the complexity of the 
physics of that phase. Thus, the ~ t~ x DTD dependence 
of the DD channel cannot be considered unavoidable. Be 
that as it may, the DTD reconstructions by Totani et al. 
(2008), Maoz et al. (2010), Maoz & Badenes (2010), and 
in this work (§4), all point to a ~ t^ 1 power-law. 

A different power-law DTD dependence, with different 
physical motivation, has been proposed by Pritchet et 
al. (2008). If the time between formation of a WD and 
its explosion as a SN la is always brief compared to the 
formation time of the WD, the DTD will simply be pro- 
portional to the formation rate of WDs. Assuming that 
the main-sequence lifetime of a star depends on its initial 
mass, m, as a power law, 



t ~ m° 



and assuming the IMF is also a power 



law, 



dN 
dm 



~ ra 



(9) 



(10) 



then the WD formation rate, and hence the DTD, will 
be 

dN = dN_dm ^ (i +A -5)/« (n] 
dt dm dt ' ' 

For the commonly used value of S = —2.5 and the 
Salpeter (1955) slope of A = —2.35, the resulting power- 
law index is —0.46, or roughly —1/2. Pritchet et al. 
(2008) have argued that such a t" 1 / 2 form for the DTD 
can explain the trend of SN la rate versus specific star- 
formation rate of the host population in the Supernova 
Legacy Survey. 

We test model power-law DTDs with these two par- 
ticular power-law indices, —1, and —1/2, as well as a 
continuous range of indices, s, against the cluster data. 
We assume *&(t) = for t < 40 Myr, corresponding to 
the lifetime of 8M© stars (e.g, Girardi et al. 2000), at 
the border between CC and WD formation. We start by 
setting the star- formation redshift at Zf = 3, i.e., at a 
cosmic time tf = 2.1 Gyr. 

Fitting the observed cluster SN rates with a power law 
DTD having a free normalization and index s, the best 
fit is obtained for s = —1.2. The time-integrated SN 
number is Nsn/M* = 0.0044 for a power law of this 
slope, which is intermediate to the "minimal iron" value 
we have derived above (Eq. 3) and the "optimal" value 
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Fig. 3. — Fits of the predictions of 't oc t -1 / 2 power-law DTDs 
to the observed cluster SN la rates, plotted as a function of time 
since the Big Bang, assuming instantaneous cluster star formation 
at Zf = 3, and SN la events starting 40 Myr after star forma- 
tion. The best-fitting version (dotted curve) has a time-integrated 
number of SNe that is only a minor fraction of that required to pro- 
duce observed cluster iron-to-stellar mass ratios. Versions scaled 
to produce the minimal (dashed curve) or optimal (solid curve) 
integrated number of SNe give poor fits to the cluster rates, with 
the reduced \ 2 values indicated. 



(Eq. 2). However, with the normalization thus uncon- 
strained, x 2 depends weakly on s, and there is a wide 
range of indices, —1.6 < s < 2.7, that give acceptable 
fits to the observed rates. (We will deem as "acceptable" 
those model with xl < 2, which corresponds to a prob- 
ability of > 5% for 7 d.o.f. - 9 data points minus two 
free parameters, s and the normalization.) The steeper 
ones among these power laws also have sufficiently large 
integrals. Specifically, power laws with s < —0.88 satisfy 
the minimal iron constraint, while shallower power laws, 
e.g., s — —0.5, do not. 

Conversely, ans= —0.5 power-law normalized to have 
the correct integral, when compared to the data, grossly 
overpredicts the observed rates, giving an unacceptably 
high reduced chi-square, Xr — X 2 /d.o.f., of 17 and 69 per 
degree of freedom (d.o.f.), for the minimal and optimal 
normalizations, respectively. (In the latter two fits, there 
are no free parameters that are adjusted to fit the data, 
and hence we have 9 degrees of freedom, as the number 
of independent data points.) Figure 3 shows these three 
fits. Thus, a t -1 / 2 power law, while it can describe well 
the time dependence of the cluster rates at < z < 
1.45, cannot simultaneously produce the required time- 
integrated SN la numbers. 

If we force the minimal- iron constraint, the best- fit 
power law is t -1 ' 1 , with an acceptable Xr = 0.11, as 
shown in Fig. 4. The low x 2 value indicates that the er- 
rors on at least some of the rates have been conservatively 
overestimated. The combined constraints of the SN rate 
data and the minimal iron abundances limit the accept- 
able values of the power law index to s = — 1.10lo' 2 2- 
For the optimal iron constraint, the acceptable range of 



indices is s = —1.28 



+0.25 
-0.18J 



with x 2 = 0.10 for the best fit. 



Smaller, more realistic, rate uncertainties would reduce 
these allowed ranges. 

We thus see that a simple parametrized model DTD of 
the form t , or slightly steeper, can provide a good fit 
to the cluster SN la rates while simultaneously providing 
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Fig. 4. — Same as Fig 3, but for steeper power law DTDs, 
\t oc t~ s , with s = —1.1 and the minimum-iron normalization 
(dashed curve), and s = —1.3 plus the optimal-iron normalization 
(solid curve). In both cases, there is a good fit to the cluster SN 
rate data. 

a sufficient time-integrated number of SNe la to satisfy 
the iron-based constraints. 

We note that these conclusions depend weakly on the 
chosen epoch of cluster star formation, t f . For a given 
power-law slope, acceptable fits are obtained for tf in 
the range 1.2 to 2.3 Gyr, corresponding to Zf = 4.8 to 
2.8, with a best fit that has a constant minimum x 2 in 
the range Zf = 3.2 — 3.7. Varying tf cannot salvage 
power-law DTDs such as i -1 / 2 , that are strongly ruled 
out because of the shallowness of their slope. However, 
moving tf back can slightly increase the allowed range 
of slopes. For example, for the optimal iron value, the 
maximal acceptable power-law slope can be raised from 
— 1.11 to —1.08 by shifting Zf back to 3.6. 

It is arguable that, instead of a single, ~ i" 1 power 
law, motivated by binary mergers, with this power law 
extending back to delays as short as 40 Myr, there could 
be a "bottleneck" in the supply of progenitor systems 
below some delay. Such a bottleneck could be due 
to the birth rate of WDs, which behaves as ~ t -1 / 2 . 
One possible result would then be a broken-power-law 
DTD, with oc t -1 / 2 up to some characteristic time, 
t c , and VP oc t^ 1 thereafter. A possible value could be 
t c w 400 Myr, corresponding to the lifetimes of 3M 
stars. If that were the lowest initial mass of stars that 
can produce the WD primary in a DD SN la progenitor, 
then beyond t c the supply of new systems would go to 
zero, and the SN la rate would be dictated by the merger 
rate. 

We have therefore attempted to fit the data with such 
a i -1 / 2 , t^ 1 broken power law. With the DTD nor- 
malization fixed to produce the required mimimum iron 
integrated SN numbers, this model gives an acceptable 
X 2 = 1-2, though this is considerably worse than the sin- 
gle power-law fit, and could become unacceptable with 
more realistic errors. We test the optimal iron value 
with a £ -1 / 2 , t -1-3 broken power law, that is steeper 
at late times (since even t^ 1 alone is already rejected, 
see above). Here, too, an acceptable x 2 — 0.9 is found. 
For the minimal-iron normalization, a t -1 / 2 , t -1-1 depen- 
dence is acceptable, as long as t c < 1.5 Gyr. Such a late 
break time is interesting in the context of sub- Chandra 



merger models, in which the mergers of white dwarfs of 
initial masses smaller than 3M produce SNe la (Sim et 
al. 2010; Van Kerkwijk et al. 2010). 

To summarize, the cluster SN rates plus iron abun- 
dances can be fit with a power- law DTD, under some 
conditions. Assuming the minimal iron mass value in- 
dicated by cluster observations, a range of slopes with 
values of s « —1, or somewhat steeper, is allowed. A 
break to a shallower t -1 / 2 dependence at t < t c is also 
permitted, provided the power-law is steep enough at 
longer delays, and the break docs not occur too late. For 
the acceptable range of power-law DTDs, between 50% 
and 85% of SNe la explode during the first Gyr after star 
formation. 

5.2. DTDs from binary population synthesis models 

Over the past decade, a number of groups (e.g., Yun- 
gelson & Livio 2000; Han & Podsiadlowski 2004; Bcl- 
czynski et al. 2008; Ruiter et al. 2009; Bogomazov & 
Tutukov 2009; Wang, Li, & Han 2010; Mennekens et 
al. 2010) have calculated DTDs using binary popula- 
tion synthesis (BPS), in which a large number of bina- 
ries with a chosen distribution of initial conditions are 
followed through the stages of stellar and binary evolu- 
tion, to the point that some of them reach the conditions 
for explosion as SNe la. In our comparison to cluster 
SN rates, we focus here, as examples, on the models pre- 
sented by Yungelson & Livio (2000), and on the more 
recent ones by Mennekens et al. (2010). As opposed to 
the simple parametrized models, considered above, the 
BPS models can make absolute predictions of SN rates 
vs. time, i.e., their normalizations are set. Therefore, in 
our comparisons of these models to the observed rates 
and to the integrated iron mass in clusters, we first con- 
sider the "raw" predictions of the models, without any 
scalings, and then proceed to test scaled versions. 

Yungelson & Livio (2000) studied four different evo- 
lutionary paths to a SN la: a DD model; an SD model 
with accretion of He from a giant companion and deto- 
nation at sub-Chandrasekhar mass, through an edge-lit 
detonation caused by ignition of the He layer (He-ELD); 
and SD models with accretion from a sub-giant com- 
panion and detonation at the Chandrasekhar mass (SG- 
Ch), or through an edge- lit detonation (SG-ELD). The 
DTDs for these different paths can be seen in their fig. 
2. We have scaled up these DTDs by a small factor of 
1.05, to convert from the IMF assumed by Yungelson & 
Livio (2000) to our adopted diet Salpctcr IMF. Their as- 
sumed IMF is a broken power law, of index A = —2.5 
from 0.3M© to 100M Q , and with A = from 0.08M Q 
to O.3M (L. Yungelson, private communication). (For 
a fully self-consistent comparison of the shapes of the 
DTDs, their models would need to be re-calculated with 
our adopted IMF, but it is unlikely that the different 
IMF slopes, —2.5 and —2.35, over the limited range of 
masses that contribute to SN la progenitors, would lead 
to major changes in the DTD). Two of the models, He- 
ELD and SG-Ch, predict no SNe beyond 1.5 - 2 Gyr 
after star formation, where all the measured cluster SN 
rates are. These models are obviously inconsistent with 
the observed rates, although they could play a role in a 
multi-component DTD scenario (see §7, below). The DD 
and SG-ELD models, on the other hand, do predict SN 
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Fig. 5. — Comparison of the observed cluster SN rates to the 
predictions of the DD model from the BPS simulations of Yungcl- 
son & Livio (2000). The "raw" prediction of the model (dashed 
curve), without any scaling, underprcdict both the observed rates 
and the required integrated number by a factor of 5. A version 
of the model (solid line) scaled to produce the minimal-iron nor- 
malization matches well the observed rates. When scaled to the 
optimal-iron value, however, it overpredicts the low-redshift cluster 
rates. 

events on long time scales, and can be tested. 

Figure 5 compares the predictions of the DD model of 
Yungclson & Livio (2000) to the observations. The "raw" 
version of this model, without any re-scaling of the DTD 
(beyond the adjustment above for conversion between 
the IMFs), underpredicts the observed cluster SN rates, 
with a poor Xr = 2.9. Its integrated number of SNe per 
formed stellar mass is TVsn/jW* = 0.0007, just 1/5 of the 
minimal-iron value, and only 12% of the optimal value. 
This confirms previous assertions by Maoz (2008), Ruiter 
et al. (2008), and Mennekens et al. (2010), that BPS 
models underpredict observed SN rates by at a factor of 
at least a few, and likely by more. As seen in Fig. 5, if 
we scale up the model by a multiplicative factor of 5, so 
as to integrate to the minimal-iron value, we obtain an 
acceptable Xr = 0-3 for the rates. Although BPS models 
have many free parameters, it is not clear that such a 
level of scaling-up of the model could be achieved easily. 
If we force the DTD integral to the optimal-iron value, 
the predicted rates are too high, giving an unacceptable 
Xl = 3.0. 

Proceeding to the SG-ELD model of Yungelson & Livio 
(2000), this model (Fig. 6) always gives a poor fit to 
the SN rates, whether in its raw form (which again pro- 
duces only 23% of the minimum iron value), or scaled to 
satisfy the iron constraints, or even if scaled arbitrarily. 
The SG-ELD model begins making SNe la only about 
800 Myr after star formation, and thus misses the op- 
portunity of producing the bulk of the iron mass during 
that time. Furthermore, this model then predicts SN 
rates that fall too steeply with time. These problems 
cannot be alleviated by a change in the star-formation 
epoch, tf, which has little effect on the fits, as was the 
case for parametrized power-law DTDs. 

We turn now to the BPS models of Mennekens et 
al. (2010), who have examined both DD and SD mod- 
els. They have assumed a Kroupa et al. (1993) IMF, 
consisting of a three-part broken power law, with index 
A = -1.3 from 0.08M Q to O.5M , A = -2.2 from O.5M 
to 1M Q , and A = -2.7 from 1M Q to 100M Q . The mass 
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Fig. 6. — Same as Fig. 5, but for the SD model of Yungelson & 
Livio (2000) involving a sub-giant donor and an edge-lit detonation. 
The cluster epoch of star formation, as before, is assumed to be 
Zf = 3, but in this model the first SN la events are delayed by 
~ 800 Myr. Because of the steep decline of this model DTD, 
neither the raw models, nor those scaled to the normalizations 
required by iron abundances, match the rate observations. 

ratio between this IMF and the diet-Salpeter IMF is 0.92, 
and we therefore scale down the Mennekens et al. (2010) 
DTDs by this factor. 

In their DD models that we examine, Mennekens et 
al. (2010) introduce the possibility of non-conservative 
Roche-lobe overflow, which they parametrize with a (3 
parameter, the fraction of material lost by the donor 
star that is accepted by the accreting star. Values of 
(3 = 0,0.8,0.9, and 1.0 are considered. The a parame- 
ter (Webbink 1984), which is the fraction of the orbital 
energy lost during the common-envelope phase that is 
transferred to kinetic energy of the envelope, is set to 1. 
An additional DD model is calculated with (3 = 1, but 
treating the common-envelope phase, instead of with the 
a model, by using the 7 parametrization of Nelemans & 
Tout (2005), with 7 = 1.5. The 7 approach quantifies 
the change in angular momentum during the common 
envelope phase. Two SD models with (3 = 1 have also 
been calculated by Mennekens et al. (2010), one with 
a = 1.0 and one with 7 = 1.5. 

Fitting these models to the cluster rates, with or with- 
out the cluster iron constraints, we find the following. 
None of the raw DTDs of Mennekens et al. (2010) pro- 
duce enough time-integrated SN numbers to reproduce 
the observed iron abundances. At best, the (3 = 0.8 and 
(3 = 0.9 DD models produce 12% of the minimal-iron 
value, and the SD a = 1 model makes 16%. The other 
models make only a few percent or less of the minimal 
number of SNe required by cluster abundances. If we 
scale up the models, forcing the minimal-iron normal- 
ization, the Mennekens et al. (2010) SD models, qual- 
itatively like the Yungelson & Livio (2000) SD models 
discussed above, predict no SNe 4 — 5 Gyr after star for- 
mation, and hence cannot match the observed cluster 
rates. This is illustrated in Fig. 7. Two of the scaled 
Mennekens et al. (2010) DD models give acceptable fits 
to the observed cluster SN rates: DD with a = 1 and 
[3 = 0.9; and DD with 7 = 1.5 and (3 = 1. These fits are 
shown in Fig. 8. 

To summarize our analysis of BPS models from these 
two teams, the emerging picture is that SD models al- 
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Fig. 7. — Rate predictions based on DTDs from two SD models 
by Mcnnckens ct al. (2010), based on two different parametriza- 
tions of the physics of the common-envelope phase (see text). The 
models have been scaled up by factors of 80 (7 = 1.5 model, 
solid curve) and 6.4 (a = 1.0 model, dashed curve), to match 
the minimal-iron constraint but both fall too steeply to match the 
observed cluster SN rate redshift dependence. 
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Fig. 8. — Same as Fig. 5, but for two of the DD models from 
the BPS calculations of Mennekens et al. (2010) that can match 
the data, if the raw DTDs are suitably scaled up according to the 
optimal iron requirement, or the minimal requirement, as marked. 
The parameter /3 quantifies the degree of mass conservation during 
the Roche-lobe overflow phase. 

ways fit the observations poorly - in terms of both the 
absolute "raw" numbers of SNe they predict, and the 
time dependence of the cluster SN rates they predict. 
The raw DD models also underpredict the absolute SN 
numbers, but by lower factors, of 5 — 8. If we treat the 
BPS models as scalable, then for some of the DD models 
it is possible to simultaneously satisfy the minimal-iron 
constraints and the observed cluster SN rate dependence 
on redshift. The higher, optimal-iron, constraint can also 
be satisfied by two scaled DD models among those of 
Mennekens et al. (2010). 

5.3. DTDs from analytical models 

Another approach to making DTD predictions, fol- 
lowed by Greggio (2005), is to calculate analytical DTD 
models, based on stellar evolution arguments and on var- 
ious parametrizations of the possible results of the com- 
plex common-envelope phases through which SN la pro- 
genitor systems must pass. For each of several SN la 



Redshift 





54 Z 


3 1.5 


1 


0.6 







_ 1 1 | 1 


1 1 1 ' 


1 ■ ■ 


1 1 1 : 

G2005 models 


10 


— 




— 


DD 


-close 3, (N/M.)opt - 






\ v 




t- 1/2 ,t-", (N/M.)„ pl ; 










DD 


-close 2, (N/M,) mln ; 


1 








SD, (N/M.),m. 














0.1 
















1 , 1 






1 , , " 



5 10 

Cosmic Time [Gyr] 



Fig. 9. — Comparison of the observed cluster SN rates to predic- 
tions by three of the analytic models of Greggio (2005), scaled to 
give either the minimal or optimal ratio of integrated SN numbers 
to stellar mass. The SD model, like the SD models from the BPS 
simulations, underpredicts the low-2 cluster SN rate. Two of the 
DD models, however, can match the data. Also shown, for compar- 
ison (dotted line), is the prediction from a simple broken-power- 
law DTD: t- 1 / 2 at 40 Myr< t < 400 Myr; t -1 - 3 for t > 400 Myr, 
having the minimal-iron normalization, which is similar to the DD- 
close-3 model (dashed curve). 

channels, she calculated the DTDs that emerge when 
varying the values for the parameters describing the ini- 
tial conditions, and the mass and separation distributions 
and limits of the systems that eventually explode. Men- 
nekens et al. (2010) have criticized this approach in that 
it overlooks the changes in stellar-evolution timescales 
that occur as a result of the "rejuvenation" due to mass 
transfer between stars. On the other hand, an advantage 
of the analytic approach, compared to the BPS approach, 
is that predictions for a large range of parameters can be 
made quickly. This successful range can then be investi- 
gated in more detail with actual BPS simulations. 

We focus on a selection of representative models shown 
in fig. 1 of Greggio et al. (2008). These include one SD 
model and four DD models, computed under different as- 
sumptions for these parameters. The "wide" and "close" 
labels of the DD models refer to two possible parametric 
schemes used by Greggio (2005) to describe the WD sep- 
aration distribution after the common envelope phase. 
The Greggio models do not predict the absolute levels of 
the SN rates (i.e., the normalization of the model DTDs), 
as this is another free parameter in the models. We there- 
fore examine only various scaled versions of the Greggio 
models. 

The DD-close model with a minimum secondary ini- 
tial mass of 3M fits the observed SN rate redshift de- 
pendence, while satisfying either the minimal iron con- 
straint (with \r — 0.52) or the optimal iron constraint 
(with Xr = 0.45). Maoz et al. (2010) have already noted 
the good agreement between this Greggio (2005) model 
and a DTD reconstructed from the SNe in the LOSS- 
SDSS subsample, and the fact that this Greggio model 
is similar to a £ -1 / 2 ,t -1 ' 3 , broken power-law with break 
at t c < 400 Myr. Such a broken power law was shown 
to fit well the data and the minimal-iron constraint in 
§5.1, above. The resemblance is seen in Fig. 9. The DD- 
close model with a minimum secondary mass of 2M Q , 
also shown in Fig. 9, can fit the rates as well, but only 
with the minimal-iron constraint, due to its shallower 
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slopes at both early and late times. The two DD-wide 
models of Greggio (2005) we test, with minimal masses 
of 2M Q and 2.5M , are both too shallow to fit the rates 
and either of the normalizations, and give Xr > 3.6. 

Finally, the SD model of Greggio (2005), while having 
a DTD that is more extended in time than the SD mod- 
els from BPS that we have examined above, suffers from 
similar problems, particularly a steep drop in rates be- 
yond ~ 8 Gyr. The minimal-iron normalized SD model, 
shown in Fig. 9, is formally acceptable, with x\ < 2, but 
its low predicted rates at low redshifts are in conflict with 
the measurements. (Poisson statistics would be more ap- 
propriate to compare the low prediction to the accurately 
measured, non-zero, rates at low redshift.) 

Thus, only the two DD-close Greggio (2005) models 
are consistent with the rates and the iron data. This is 
simply the result of the fact that DD models generically 
produce power-law DTDs, and the power-laws required 
to fit simultaneously the observed rates and the normal- 
izations need to have indices of w —1 or steeper. Such 
indices can be obtained by emerging from the common- 
envelope phase with a steep power-law separation distri- 
bution, i.e., with relatively more close pairs, as discussed 
in §5.1 and seen in Eq. 8. Naturally, we have examined a 
limited range of the Greggio (2005) models, and it would 
be interesting to see if there are others (e.g. Greggio 
2010) that do fit the data, and what are their parame- 
ters. 

6. COMPARISON TO DTD PREDICTIONS - 
NON-INSTANTANEOUS CLUSTER STAR-FORMATION 
HISTORIES 

So far, we have assumed the cluster SFH to be a single, 
instantaneous burst at zj . We now test whether relaxing 
this assumption can improve the fit of any of the mod- 
els to the data, in terms of reproducing both the time 
dependence of the SN la rate and the normalization, as 
required by the iron abundance. Furthermore, a non- 
instantaneous burst may be a more realistic description 
of cluster SFH. The predicted SN rate will now be a con- 
volution of the SFH, S(t) 1 with the DTD, after correction 
for stellar mass loss, 
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We consider a single, but non-instantaneous, burst of 
star formation in galaxy clusters in the form of an ex- 
ponentially decaying SFH, S(t) oc cxp[— (t — £/)/tsf]) 
starting at time tf. More complex SFHs are considered 
in §7.2, below. We have re-fit the cluster rates with the 
iron-mass normalizations using all the DTD models dis- 
cussed in §5, above, but convolved with this exponential 
SFH, with values of r SF = 0.5, 1, 2, 3, 4 Gyr. The effect on 
the SN rate of the convolution between the DTD and any 
temporally extended star formation is always to transfer 
some fraction of the SN events to later times. Since all 
the DTDs that we consider peak at short delays, this 
means that the SN rate rises more slowly at early times 
than in the instantaneous burst approximation. This, in 
turn, means that a smaller fraction of the time-integrated 
SN number, dictated by the iron abundances, can be pro- 
duced at early times. Raising the model normalization, 
such that the iron constraints are met, then results in a 
poor fit to the SN rate data, compared to the instanta- 
neous case. To try to mitigate this effect, we shift back by 
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Fig. 10. — Comparison of the data to predictions of a t~ 11 
power-law DTD, convolved with an exponential SFH starting at 
Zf = 5, and several characteristic exponential times r s f. The pre- 
diction for the same DTD, but following an instantaneous burst 
at Zf = 3, shown before in Fig. 4, is plotted for comparison (dot- 
ted curve). All curves are normalized to satisfy the minimal- iron 
constraint. Such extended SFHs degrade the agreement with ob- 
servations, and r 3 f < 2.7 Gyr is required to avoid significantly 
overpredicting the rates. 
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Fig. 11.— Same as Fig. 10, but for the Yungelson & Livio (2000) 
SG-ELD model, previously shown in Fig. 6. The convolution of this 
DTD with an extended exponential SFH cannot improve the fit to 
the SN rates, if the iron constraints on the normalization are to be 
simultaneously satisfied. 

1 Gyr the time of initial star formation to tf = 1.1 Gyr, 
corresponding to Zf = 5. Nonetheless, in every case, the 
fit with an extended SFH is worse than in the case of an 
instantaneous burst. 

These results are illustrated with two examples. 
Figure 10 shows the predictions of a minimal-iron- 
normalized t power-law DTD, with the exponential 
SFH starting at zj = 5, and several characteristic ex- 
ponential times T s f. Also shown, for comparison, is the 
Zf = 3 instantaneous burst, shown before in Fig 4. At 
late times, all the predictions have similar slopes, while at 
progressively shorter times, the rate dependence is shal- 
lower for the more extended SFHs. This leads to pro- 
gressively greater overprediction of the observed rates. 
Formally, r s / > 2.7 Gyr is ruled out, based on Xr > 2. 
Conversely, if we ignore the iron constraints and find 
the best-fit normalization, then the integrated number 
of SNe la is significantly lower than required by the min- 
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imal constraint (e.g., by a factor 2, for r s / = 3 Gyr). 
In Fig. 11, we show the same exercise for the SG-ELD 
model of Yungelson & Livio (2000). It might have been 
hoped that the convolution with an extended SFH could 
moderate the steep fall at late delays of this DTD, and 
thus provide a better fit to the observed weak time de- 
pendence of the rates. Although the steep fall, predicted 
in the instantaneous burst case, is indeed moderated, 
the smoothing at short delays lowers the contribution 
from early times to the integrated SN numbers, forcing 
a higher overall normalization and a poor fit. 

To summarize this section, the moderate slope of the 
observed SN rate redshift dependence at < z < 
1.4., combined with the large time-integrated number of 
SNe la indicated by the iron abundance, together call for 
a DTD that is sharply peaked at short delays, but with a 
low tail out to long delays. Convolution of any of the few 
single-DTD models that satisfy these constraints with 
any simple SFH that is extended on timescales > 1 Gyr 
only degrades the fits. 

7. COMPARISON TO DTD PREDICTIONS - 
TWO-COMPONENT MODELS 

We now examine to what degree the challenges of re- 
producing the observations can be overcome by the addi- 
tion of free parameters that is implicit in the combination 
of multiple components - either two DTDs (as could be 
expected from the co-existence of two distinct physical 
SN la channels, e.g., DD and SD), or two components of 
cluster SFH, as opposed to the single bursts assumed so 
far. 

7.1. Double DTD models 

The idea of two co-existing SN la channels, prompt 
and delayed, emerged several years ago from the obser- 
vation of, on the one hand, a proportionality between 
star-formation rate (SFR) and SN la rate per unit stellar 
mass in star forming galaxies, and on the other hand, a 
non-zero SN la rate in early-type galaxies with no current 
star formation (Mannucci et al. 2005, 2006; Scannapieco 
& Bildsten 2005; Sullivan et al. 2006). As noted in §1, 
this observation does not necessarily imply the existence 
of two separate physical channels. Instead, it could be 
the manifestation of a DTD from a single channel but 
with delays spread over a wide range of timescales. In- 
deed, our analysis, above, shows that single parametrized 
DTDs of power-law form, with indices somewhat steeper 
than — 1 can match the observational constraints (as can 
some DD models that produce DTDs of this type, if they 
are suitably scaled up). However, some of the other the- 
oretical DTDs, that individually are incompatible with 
the data because they predict no delayed SNe, can be 
"saved" by incorporating them into a two-channel pic- 
ture. 

For example, two of the models of Yungelson & Livio 
(2000), He-ELD and the SG-Ch, predict no SNe at long 
time delays. Figure 12 shows the predicted cluster rates 
from the combination of the He-ELD model and the DD 
model, discussed in § 5.2. The raw sum of these two 
models, as before, produces a too-small fraction (less 
than half) of the total number of SNe la indicated by 
the minimal iron constraint, and underpredicts all the 
SN rates below z < 0.5 (by a factor of 5). However, as 
seen in Fig. 12, scaling up this composite DTD by factors 
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Fig. 12. — Predicted SN rate dependence for a composite DTD 
model, combining the He-ELD and DD models of Yungelson & 
Livio (2000). Dotted curves show the raw predictions of these 
models, and lower solid line is their sum. The raw sum under- 
predicts the low-redshift rates by a factor of 5 (as was the case 
for this DD model alone, Fig. 5), and the integrated SN number 
by 2.1 (minimal iron) to 3.6 (optimal iron). Scaled-up versions of 
these models do give a satisfactory fit to all the observations, as 
indicated. 

of 2.1 or 3.6 can solve these problems for the minimal and 
optical iron cases, respectively. In this example, 60% of 
the SNe (and the iron) are from the prompt He-ELD SD 
component, and the rest from the DD component. 

Similar combinations of two components can work us- 
ing the other DTDs we have considered, as long as one of 
the components is a DD model (or a similar power law) 
that can provide the SNe with long delays. In addition 
to the choice of components, one can choose the rela- 
tive scaling between them, providing a further adjustable 
parameter. The current data of course cannot discrim- 
inate between the various possible prompt components 
as they make their contributions beyond the redshifts at 
which rate measurements exist. Our experiments at com- 
bining different DTD are obviously not exhaustive, but 
the emerging picture is nonetheless clear. The observed 
SN rates and the iron mass in clusters can be explained 
simultaneously by combining "prompt" and "delayed" 
DTDs. The SNe la from the prompt component produce 
the majority of the iron mass in clusters. The SNe from 
the delayed component produce only a fraction of the 
metals, and they are the events detected by current SN 
rate measurements, with their weak time dependence. 

7.2. Composite star formation histories 

A final scenario we examine is that of a single DTD, but 
with a composite cluster SFH, consisting of a short star- 
burst, beginning at Zf, combined with a more extended 
SFR. With the additional free parameters introduced in 
this scheme, it is easy to find combinations of DTDs 
with such composite SFHs that reproduce the SN la rate 
versus time, while simultaneously providing a sufficient 
time-integrated number of SNe to produce the observed 
ratio of iron mass to stellar mass. Figure 13 shows an 
example. Here, we have used the a = 1, j3 = 1 SD model, 
discussed previously in §5.2, from the BPS calculations 
ofMennekensetal. (2010). We recall that this DTD pro- 
duces only SNe la with short delays. In the context of 
the present exercise, this is desirable, as the observed SN 
rate dependence in generated by the SFH rather than by 
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Fig. 13. — Same as Fig. 12, but with a single, prompt, DTD, and 
a composite SFR - an exponentially decaying burst at Zf =3, with 
characteristic times as labeled, and a constant "DC" component of 
star formation, with a ratio between the components of 200:1 at 
2 = 3. The DTD is the a = 1 SD model of Mennekens et al. (2010), 
previously shown if Fig. 7. Normalizations are minimal-iron for the 
short-timescale burst and optimal-iron for the two longer-timcscale 
bursts. As seen in the figure, such composite SFHs can match the 
rate observations and the iron constraints with a single, prompt, 
DTD. However, the constant and high level of star formation down 
to low redshifts is at odds with other observations of cluster galax- 
ies. 

the DTD. In this example, for the two SFH components 
we take the sum, S(t) = Si(t) + S2 (t), of an exponential 
SFR, Si(t) oc exp[— (t — tf)/Tsp] as in §6, and a constant 
SFR, S2(t) = const. The relative levels of the two com- 
ponents are adjusted to fit the iron-based normalization 
constraints. For this choice of functions, t$f < 4 Gyr is 
required. A more prolonged exponential component ei- 
ther overpredicts the high- z rates, or forces the constant 
component to overpredict the low- z rates. 

This scenario can, in principle, explain the presently 
discussed data in terms of "prompt" DTDs such as those 
from SD models, combined with residual star formation. 
However, it is at odds with many other observations of 
clusters. Star-formation activity in clusters avoids the 
cluster cores (< 1 Mpc), and increases progressively with 
radius (e.g., Hansen et al 2008; Porter et al. 2008; Bai 
et al. 2009; Saintonge et al. 2008; Loh et al. 2008). In 
contrast, all of the cluster-SN candidate hosts in Sharon 
et al. (2010) are found at projected distances < 0.7 
Mpc from the brightest cluster galaxy. In the sample 
of Barbary et al. (2010) all are within < 0.8 Mpc, or 
even < 0.5 Mpc if excluding the outermost one 6 . Fur- 
thermore, the SNe from the various surveys we analyze 
are almost always found in early-type, red-sequence (and 
hence apparently quiescent), galaxies. For example, in 
the sample of Gal- Yam et al. (2008), for 4 out of the 5 
cluster SNe la that have host galaxies, the galaxies are 
early-types, based on colors and spectra. In the sample 
of Sharon et al. (2010), 6 out of the 7 most likely cluster 
SN la hosts are red-sequence galaxies. In Barbary et al. 
(2010), this figure is 8 out of 9. Thus, the cluster SNe la, 
on which are based the rates that we analyze, are nei- 
ther at the locations nor in the types of galaxies where 
star-formation in clusters is actually observed to occur. 

6 It would be useful to study the dependence of cluster SN rates 
on radial distance from cluster cores. Such an analysis may be 
feasible, however, only with a larger sample of cluster SNe. 



Quantitatively, in the above version of the 
burst +constant SFH scenario, as shown in Fig. 13, 
for, e.g., tsf = 0.5, ~ 93% of the present-day cluster 
stellar mass is formed before z = 1, and the remaining 
7% is from the constant SFR between z = 1 and 0. A 
typical monitored stellar mass in the clusters we consider 
is ~ 10 13 M Q (see § 3.2) or, correcting to formed mass 
before stellar evolution mass losses, ~ 2 x 10 13 M Q . 
With a lookback time to z = 1 of 8 Gyr, the 7% fraction 
implies a constant SFR between z — and 1 of ~175 
M Q yr -1 , just in the central regions of clusters. 

By comparison, several recent studies (e.g., Krick et al. 
2009, Bai et al. 2009; Koyama et al. 2010) find SFRs, 
integrated over the galaxies within a cluster, between 
~ 10 and a few hundred M yr -1 . Some clusters thus do 
have the high integrated SFRs required in the scenario 
of a composite SFH + prompt DTD (ignoring for the 
moment that this SFR is not in the central, quiescent 
galaxies that are seen to host the SNe, but rather in the 
outer regions, e.g., Haines et al. 2009). However, the 
observed cluster SFR, normalized by cluster mass, rises 
steeply with redshift, as (1 + z) p , with p = 5.3 ± 1.2 
(Bai et al. 2009), p = 5.7111 (Haines et al. 2009), 
or p « 6 (Koyama et al. 2010). This corresponds to 
one or two orders of magnitude increase in SFR over 
the < z < 1 range, and is in contrast to the roughly 
constant mass-normalized SFR that is required in this 
range, in the composite SFH scenario, to reproduce the 
cluster SN rates. 

We also note that, if the DTD were universal in en- 
vironment and in time, and the SFH + DTD scenario 
were true, then it would apply to early-type galaxies in 
general. SNe la in all early types, whether in the field 
or in clusters, and at low or high redshifts, would be 
the prompt outcome of low levels of recent but unseen 
star-formation in these galaxies. However, a study by 
Foerster & Schawinski (2008) of the early-type hosts of 
SNe la argues against this possibility. Furthermore, the 
DTD reconstruction in nearby galaxies by Maoz et al. 
(2010) demonstrates a 4cr detection of a delayed SN la 
component, with delays of 2.4 < r < 13 Gyr. 

We therefore conclude that the scenario of a 
burst+constant SFH, combined with a prompt single- 
component DTD, is probably not a viable model. While 
it is capable of explaining the cluster SN rate data and 
the observed iron to stellar mass ratio, it is incompati- 
ble with other cluster observations. Those observations 
indicate that, although about 10% of the stellar mass 
of clusters formed at z < 1, this activity did not occur 
in the central, quiescent, galaxies hosting the SNe found 
by current surveys, and that the SFR falls steeply with 
cosmic time, in contrast to the flat SFH required to re- 
produce, under this scenario, the flat observed SN rate. 

8. CONCLUSIONS 

Measurements of galaxy-cluster SN la rates as a func- 
tion of cosmic time provide a particularly direct avenue 
to obtain the DTD of SNe la, with its implications for 
progenitor models and cosmic history. Such measure- 
ments can constrain both the functional form and the 
normalization of the DTD. We have combined recently 
completed measurements of cluster rates between z = 
and z = 1.45 with the latest results on the iron mass 
content of clusters, to recover the SN la DTD, and to 
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test a variety of model DTDs, from purely mathemat- 
ical parametrized forms, to DTDs obtained from more 
detailed physical considerations. 
Our main results are as follows. 

1. Assuming that the bulk of the stars in clusters formed 
with a normal IMF in a brief burst at Zf « 3, as in- 
dicated by optical spectroscopy of cluster galaxies, the 
SN la DTD can be directly recovered from the observa- 
tions. The resulting DTD peaks at the shortest delays 
probed, < t < 2.2 Gyr, decreases steeply at longer de- 
lays, and extends out to 11 Gyr. The recovered DTD 
agrees remarkably well, both in shape and in absolute 
normalization, with DTDs recently recovered using dif- 
ferent techniques, in different environments: in field el- 
lipticals (Totani et al. 2008); in nearby galaxies (Maoz 
et al. 2010); and in the Magellanic Clouds (Maoz & 
Badenes 2010). The current derivation is complemen- 
tary to previous ones in that it recovers the DTD over 
its full time range, with good time resolution at delays 
> 3 Gyr. The emerging picture is of a universal DTD 
that is not strongly dependent on environment or cosmic 
time, and can be well represented by a single power law, 
of index s ~ — 1. 

2. Comparing the data to theoretical or phenomenologi- 
cal DTDs in a forward-modeling process, the best para- 
metric description of the DTD is a power law, 

tt(t) » 0.01 SN yr^lO 10 M^ 1 (j^j (13) 

with index s = —1.1 ± 0.2 or s = —1.3 ± 0.2, depend- 
ing on whether we adopt a minimal-iron constraint or 
an optimal- iron constraint, respectively, based on clus- 
ter data. Single-component DTDs consisting of shal- 
lower power laws, such ^ oc i -1 / 2 , cannot simultaneously 
match the observed SN rates and the integrated SN la 
numbers dictated by the iron-to-stellar mass ratio. 

3. Physical DD models from the BPS simulations we 
have examined can match the observations, provided 
they are scaled up in numbers by factors of 5 — 8. On 
the other hand, SD models, on their own, fail because 
they do not produce SNe at late delays, as implied by 
the data. Our results thus provide strong support for 
the double-degenerate SN la progenitor scenario. 

4. The above conclusions are insensitive to the ex- 
act epoch of star formation in clusters, in the range 
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Zf = 2 — 5. The conclusions also do not change if one re- 
places the instantaneous starburst with an exponentially 
decaying SFH, except that the model fits deteriorate with 
increasing star-formation timescale, and so the range of 
acceptable models shrinks. 

5. Multi-component models, that either combine freely 
scaled versions of two DTDs (e.g. SD and DD), or two 
SFH models (such as a burst plus a constant) with a 
prompt DTD, have enough freedom to permit many com- 
binations that can match the existing data. However, 
apart from the loss of simplicity involved, such combina- 
tions face other problems. For the composite DTD mod- 
els, it remains to be seen if the required combinations 
and scalings are physically plausible. For the composite 
SFHs, a high SFR in the central regions of clusters, and 
which remains constant between < z < 1, is required, 
in conflict with observations of clusters at these redshifts. 

6. All of the successful DTD models produce just a frac- 
tion of the time- integrated numbers of SNe la at redshifts 
below 1.4. A clear prediction of these models is, there- 
fore, that SN surveys of clusters or proto-clusters at even 
higher redshifts, approaching the stellar formation red- 
shifts of the clusters, should reveal a sharply rising rate of 
SNe la. Alternatively, observation of a non-rising SN la 
rate would raise the possibility that the bulk of metals 
in clusters was produced by CC SNe from a top-heavy 
IMF that exploded even earlier in cluster environments. 
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